Adapting ELM to Time Series Classification: A Novel Diversified Top-k Shapelets Extraction Method
نویسندگان
چکیده
ELM (Extreme Learning Machine) is a single hidden layer feed-forward network, where the weights between input and hidden layer are initialized randomly. ELM is efficient due to its utilization of the analytical approach to compute weights between hidden and output layer. However, ELM still fails to output the semantic classification outcome. To address such limitation, in this paper, we propose a diversified top-k shapelets transform framework, where the shapelets are the subsequences i.e., the best representative and interpretative features of each class. As we identified, the most challenge problems are how to extract the best k shapelets in original candidate sets and how to automatically determine the k value. Specifically, we first define the similar shapelets and diversified top-k shapelets to construct diversity shapelets graph. Then, a novel diversity graph based top-k shapelets extraction algorithm named as DivTopkshapelets is proposed to search top-k diversified shapelets. Finally, we propose a shapelets transformed ELM algorithm named as DivShapELM to automatically determine the k value, which is further utilized for time series classification. The experimental results over public data sets demonstrate that the proposed approach significantly outperforms traditional ELM algorithm in terms of effectiveness and efficiency.
منابع مشابه
Ultra-Fast Shapelets for Time Series Classification
Time series shapelets are discriminative subsequences and their similarity to a time series can be used for time series classification. Since the discovery of time series shapelets is costly in terms of time, the applicability on long or multivariate time series is difficult. In this work we propose Ultra-Fast Shapelets that uses a number of random shapelets. It is shown that Ultra-Fast Shapele...
متن کاملChannel masking for multivariate time series shapelets
Time series shapelets are discriminative sub-sequences and their similarity to time series can be used for time series classification. Initial shapelet extraction algorithms searched shapelets by complete enumeration of all possible data sub-sequences. Research on shapelets for univariate time series proposed a mechanism called shapelet learning which parameterizes the shapelets and learns them...
متن کاملAlternative Quality Measures for Time Series Shapelets
Classification is a very broad and prevalent topic of research within data mining. Whilst heavily related, time series classification (TSC) offers a more specific challenge. One of the most promising approaches proposed for TSC is time series shapelets. In this paper we assess the current quality measure used for shapelet extraction and introduce two statistical tests into the context of shapel...
متن کاملScalable Discovery of Time-Series Shapelets
Time-series classification is an important problem for the data mining community due to the wide range of application domains involving time-series data. A recent paradigm, called shapelets, represents patterns that are highly predictive for the target variable. Shapelets are discovered by measuring the prediction accuracy of a set of potential (shapelet) candidates. The candidates typically co...
متن کاملMining time-series data using discriminative subsequences
Time-series data is abundant, and must be analysed to extract usable knowledge.Local-shape-based methods offer improved performance for many problems, and acomprehensible method of understanding both data and models.For time-series classification, we transform the data into a local-shape space usinga shapelet transform. A shapelet is a time-series subsequence that is discriminat...
متن کامل